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Abstract. The paper deals with balances and imbalances in Arnoux-Rauzy words. We 
£N| i provide sufficient conditions for C-balancedness, but our results indicate that even a char- 

acterization of 2-balanced Arnoux-Rauzy words on a 3-letter alphabet is not immediate. 

o 

Q . 1. Introduction 

O 

C\| j Among infinite words with low factor complexity, Arnoux-Rauzy words, introduced in [2] , 

play a prominent role. Recall that factor complexity consists in counting the number of 
i_J ■ factors of a given length, and by low complexity we mean here that its growth is at most 
pL( \ linear. A seminal family of low factor complexity words is provided by the Sturmian 
words (see for instance (TTJ EH H]), an d by their finite counterpart, namely Christoffel 
words [H [lEl El HHJ H] . Sturmian words are defined in purely combinatorial terms, but their 
associated symbolic dynamical systems also have a very natural geometric description: they 
! are natural codings of rotations of the circle. Arnoux-Rauzy sequences were introduced 
in |2j in an attempt to find a class of words generalizing properties of Sturmian words. 
Again, Arnoux-Rauzy words are defined in purely combinatorial terms (see Definition [1] 
below), and they admit geometric representations as natural codings of 6-interval exchange 
transformations on the circle. We quote [5j [3] as illustrations of common behavior shared 
by Sturmian and Arnoux-Rauzy words concerning the relations between bifix codes and 
subgroups of the free group. 

A wide array of literature is devoted to the study of the combinatorial, ergodic and 
geometric properties of Arnoux-Rauzy words, which belong to the family of episturmian 
words [14J. The language of Arnoux-Rauzy words can be generated by iterating a finite 
set of substitutions; this property, which is called S'-adic, will be the main viewpoint of 
our study; see Theorem [2] below. The S'-adic generation is governed by a multidimen- 
sional continued fraction algorithm of simultaneous approximation described and studied 
for instance in [3 El M, EE] • 

Despite the fact that they were introduced as generalizations of Sturmian words, Arnoux- 
Rauzy words display a much more complex behavior, which is not yet fully understood; 
this is highlighted in [11] . Some of the properties shared by purely substitutive Arnoux- 
Rauzy words sustain the similarity with Sturmian words. For instance, Arnoux-Rauzy 
substitutions are known to be Pisot [1] and thus to generate finitely balanced Arnoux- 
Rauzy words. Here, finitely balanced means that they are C-balanced for some C > 0; see 
Definition [6] below. This is the case of the Tribonacci word, which is the Arnoux-Rauzy 
word fixed by the substitution 1 i — >■ 12, 2 i — >■ 13, 3 h> 1; see for instance [22j. Recall that 
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Sturmian words are 1-balanced. Furthermore, purely substitutive Arnoux-Rauzy words are 
natural codings of toral translations; see [7] for a proof. More generally, all Arnoux-Rauzy 
words were expected to be natural codings of toral translations, a property that implies 
finite balance; see the discussions in [121 El EH] for more details. The first striking fact 
contradicting this idea came from |12j . where examples of Arnoux-Rauzy sequences that 
are not finitely balanced were constructed. 

The aim of the present paper is to try to understand balances and imbalances in Arnoux- 
Rauzy words, continuing the study performed in [12]. We provide sufficient conditions that 
guarantee finite balance. In particular, we prove that bounded partial quotients in the S- 
adic expansion imply finite balance (Theorem[7J), but we also exhibit examples of 2-balanced 
Arnoux-Rauzy words with unbounded partial quotients (Corollary [TU]). Our results show 
that a characterization of 2-balanced Arnoux-Rauzy words on a 3-letter alphabet is not 
at hand. Indeed, the occurrence of certain patterns as prefixes of the S'-adic expansion 
prevent 2-balance (see Theorem [TT|) . but the image by an Arnoux-Rauzy substitution of 
an Arnoux-Rauzy word that is not 2-balanced can be a 2-balanced sequence. 

Note that balance properties of Arnoux-Rauzy words, and more generally of episturmian 
words, have also been investigated in view of Fraenkel's conjecture; see for instance [20] 
and see also the survey on balanced words [23] for more on this conjecture. Moreover, 
if superimposition properties of Christoffel words, such as considered in [TJJ], are now 
well understood, much remains to be done in this direction for alphabets with more than 
2 letters and for epiChristoffel words such as developped in [IB]- 

The present paper is organized as follows. After recalling the required definitions in 
Section [21 we state the main results of the paper in Section [31 Proofs are given in Sec- 
tion HI with Section 14.11 devoted to balance results, while Section 14.21 shows how to create 
imbalances. 

2. Definitions 

Let A = {1,2, ... ,d} be a finite alphabet. A substitution a over the alphabet A is an 
endomorphism of the free monoid A*. For any word w in the free monoid A* (endowed 
with the concatenation as operation), \w\ denotes the length of w, and \w\j stands for the 
number of occurrences of the letter j in the word w. A factor of a (finite or infinite) word u 
is defined as the concatenation of consecutive letters occurring in oj. A factor w of u is 
said to be right special (resp. left special) if there exist at least two distinct letters a, b of 
the alphabet A such that wa and wb (resp. aw and bw) are factors of u>. 

Definition 1 (Arnoux-Rauzy words). Let A = {1,2, ... ,d}. An infinite word u G A N 
is an Arnoux-Rauzy word if all its factors occur infinitely often, and if for all n it has 
(d — l)n + 1 factors of length n, with exactly one left special and one right special factor of 
length n. 

One checks that Arnoux-Rauzy words are uniformly recurrent, that is, all factors occur 
with bounded gaps. The symbolic dynamical system (X U ,T) associated with a given 
Arnoux-Rauzy word u is defined as the set X u of infinite words that have the same language 
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as u, endowed with the shift map T that sends an infinite word onto the infinite word from 
which the first letter has been removed. 

In order to work with Arnoux-Rauzy words, we will use their representations as S'-adic 
words according to the terminology of p31 EI] • We first define the set S of Arnoux-Rauzy 
substitutions as S = {cr.£ : i g A}, with 

0"i : i !->• i, j \-> ji for j G A \ {i} . 

Theorem 2 ([2])- An infinite word u G A n is an Arnoux-Rauzy word if and only if its set 
of factors coincides with the set of factors of a sequence of the form 

lim a lQ a h ■ • -a in (l), 

n— >oo 

where the sequence (i n )n>o £ A N is such that every letter in A occurs infinitely often 
in (i„) n >o. Furthermore, such a sequence (i n )n>o £ A N is uniquely defined for a given u. 

For any given Arnoux-Rauzy word, the sequence {i n ) n >o is called the 5-directive word 
ofu. All the Arnoux-Rauzy words that belong to the dynamical system (X W ,T) have the 
same S -directive word. 

Remark 3. // one takes the set S' = {a,- : i G A}, with 

a ■ : i H> i, j f-> ij for j G A \ {i} 
then Arnoux-Rauzy words of the form 

lim cr'„a', • • ■ er- (1) 



In l\ 
n— >oo 



are called standard or characteristic Arnoux-Rauzy words. We choose to work here with 
the set S that proves to be more convenient for handling balance results. 

The following definition is inspired by the fact that Sturmian words also admit S'-adic 
representations governed by the usual continued fraction algorithm; see [T7J [5T] for details. 

Definition 4 (Weak and strong partial quotients). Let u be an Arnoux-Rauzy word with 
S-directive sequence {i m )m>o- Write 

where j n G A, k n > 1, and j n ^ j n+ i for all n > 0. The powers k n are called weak partial 
quotients. 

Let (ni)£>o be the increasing sequence of integers satisfying n = 0, 

\i"n,( ; ^ri£+l ; • • • i inp+i } A and {iri£ j ^ni+li • • • ; ^n^ +1 — 1 / 7^ 

for all £ > 0. The quantity (ne + i — ni) is called strong partial quotient. 

Notation 5. Let u> be an Arnoux-Rauzy word with S-directive sequence (i m )m>o- For 
every integer m > 0, we define as the following Arnoux-Rauzy word with S-directive 
sequence (z n ) n >m- 

o/ m ) = lim a- a . . . . cr, (1). 



n— >oo 



Lastly, we introduce the notion of finite balance. 
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Definition 6 (C-balance). Let C G N. A pair of words u,v G A* with \u\ = \v\ is 
C-balanced if 

—C < \u\j — \v\j < C for all j G A. 

A word u) G A N is C-balanced if all pairs of factors u,v of u with \u\ = \v\ are C-balanced. 
A word u G A n is said to be finitely balanced if there exists C G N such that it is C- 
balanced. 

3. Results 

We now restrict ourselves to 3-letter Arnoux-Rauzy words (i.e., d = 3) for technical 
combinatorial reasons. 

Note that Sturmian sequences are all 1-balanced (and thus finitely balanced) words. 
This provides even a characterization of Sturmian words: Sturmian words are exactly 
the binary 1-balanced words that are not eventually periodic. Three-letter words that 
are 1-balanced are completely characterized in [15] ; in particular, Arnoux-Rauzy words 
cannot be 1-balanced, see also |20j. The so-called Tribonacci word, which is the Arnoux- 
Rauzy word fixed by the substitution 1 i — >- 12, 2 i — >• 13, 3 (->• 1, with 5-directive sequence 
123123 ■ • ■ , has been proved to be 2-balanced in [22] ■ Nevertheless, Arnoux-Rauzy words 
are now known to be in general far from being 2-balanced. They need not even be finitely 
balanced [HI Hi]. 

A natural question is to understand whether the fact that partial quotients are bounded 
can guarantee (and even characterize) finite balancedness. We have seen that there are 
two types of partial quotients. The following theorems provide sufficient conditions for 
(2/i+ Imbalance and 2-balance respectively. 

Theorem 7. Let u be an Arnoux-Rauzy word with d = 3 and S -directive sequence (i m ) m >o- 
If the weak partial quotients are bounded by h, i.e., if we do not have i m = i m+ i = ■ ■ ■ = 
i m +h for any m > 0, then u is (2h+l) -balanced. 

Theorem 8. Letu be an Arnoux-Rauzy word with d = 3 and S- directive sequence {i m )m>o- 

Ifim+2 {im,Wl} 

(1) for all m > 1 such that i m -i = i m 7^ i m +\ and 

(2) for all m > 2 such that i m -2 = i m ^ i m +\ = i m -i> 
then u is 2-balanced. 

Remark 9. The conditions of Theorem on the S-directive sequence can also be stated 
in terms of a subshift of finite type: Let X be the set of words {1121, 1122, 12121, 12122} 
together with all the words that are obtained from one of these four words by a permutation 
of the letters 1, 2, and 3. // {i m )m>o contains no factor in X, then u is 2-balanced. 

For the proofs of Theorems [7] and [U which will be given in Section I4.1[ we use meth- 
ods that are typical for the S'-adic framework: We work not only with the word u but 
simultaneously with all the words by 'desubstituting' the word a/" 1 -* with respect 
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Note that Theorem [HJ applies in particular to the Tribonacci word; a generalization of 
its 5-directive sequence 123123 • • ■ gives the following balancedness result. 

Corollary 10. There exist 2-balanced Arnoux-Rauzy words oj with unbounded weak (and 
thus unbounded strong) partial quotients. 

Proof. If the iS-directive sequence {i m ) m >o satisfies zqZj • • • = l s °231 Sl 23 • • • for any sequence 
(s n )n>o with s n > 1, then the assumptions of Theorem [8] are satisfied. In other words, any 
Arnoux-Rauzy word with such an iS-directive sequence is 2-balanced. □ 

The following result, which is proved in Section I4.2[ can be considered as a partial 
converse of Theorem [HJ It shows in particular that no element of the set X defined in 
Remark M can be omitted: If a word in X is a prefix of an 5-directive sequence, then the 
corresponding Arnoux-Rauzy word is not 2-balanced. (By symmetry of S, the statement 
of Theorem [TT] remains of course true if we apply a permutation of the alphabet A to 
{1, 3}* 1 2* 1 2 {1, 2}.) This extends a result of [12] . where it was shown that iS-directive 
sequences starting with 112213 give Arnoux-Rauzy words that are not 2-balanced. For 
arbitrary C, prefixes preventing C-balancedness are also given in [12] . 

Theorem 11. Let u be an Arnoux-Rauzy word with d = 3 and S- directive sequence (i m )m>o 
starting with a word in {1, 3}* 1 2* 1 2 {1, 2}. Then to is not 2-balanced. 

Theorem [TT] also shows that Theorem [7] is optimal for h = 1: There exist Arnoux-Rauzy 
words with all weak partial quotients equal to 1 that are not 2-balanced, e.g. those where 
the 5-directive sequence starts with 12121. 

While it is possible to exclude C-balancedness by looking at a finite prefix of the S- 
directive sequence, it was shown in [12] that one cannot guarantee C-balancedness in this 
way: For any given imbalance C, for any word w G {1,2,3}*, for any infinite iS-directive 
sequence (i m )m>o, there exists a word w' G {1,2,3}* such that the 5-directive sequence 
ww'ioii ■ ■ ■ gives an Arnoux-Rauzy word that is not C-balanced. 

Proposition [12] below gives another evidence that the situation is quite contrasted: It is 
possible that u is 2-balanced, u/ 1 ) is not 2-balanced, and u/ m ) is again 2-balanced for all 
m > 2. Thus, the image by an Arnoux-Rauzy substitution of an Arnoux-Rauzy word that 
is not 2-balanced can be a 2-balanced sequence. In particular, we cannot replace {1,3}* 
in Theorem [TT] by the full set of words {1, 2, 3}*. 

Proposition 12. Let u be an Arnoux-Rauzy word with d = 3 and S -directive sequence 
{im)m>o starting with 211213 such that the conditions of Theorem^ are violated only for 
m — 2. Then u is 2-balanced (and ouW is 2 -balanced for all m > 2), but ui 1 - 1 ' is not 
2-balanced. 

Remark 13. The conditions of Proposition 1TB can be weakened. Indeed, we will use in the 
proof only that (im) m >o starts with 211213 and that is 2 -balanced for allmE {2,3,4,6} 
to show that u is 2-balanced (and is not 2-balanced). 

Summing up, a complete description of 2-balanced Arnoux-Rauzy words on 3 letters in 
terms of their iS-directive sequences seems to be difficult. Note that our proofs strongly 
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rely on the fact that we work with 3-letter alphabets. The case of a larger alphabet is 
certainly even more intricate. 

4. Proofs 

4.1. Balances. In order to relate balance properties of u/ m+1 ) to the ones of oj( m \ we first 
need the following 'desubstitution' property. For any factor u of cu^ m \ m > 0, one easily 
checks that there exists a unique factor of u^ m+1 \ denoted by u', that satisfies 

(!) 4 ff imM=<i 

where 

f 1 if u starts with i m , j if u ends with i m , 

\ otherwise, \ 1 otherwise. 

Note that the definition of u' depends on m. (More precisely, it depends only on i m .) For 
simplicity, we omit this dependence. 

Example 14. Let u = 12131212. Assume that i m — 1. One has 8 — 1, e — 1, and 
u' = 2322, with lai(2322) = ul. 

We thus introduce the following notation. 

Notation 15. For a factor u of uj( m \ m > 0, let u' be the unique factor of u^ m+lS) satisfy- 
ing (Qp. Recursively, let vS n ' be the factor of tu^ m+r ^ given by u^ ' = u and u( n+1 ^ = (u^)' , 
n>0. 

Example 16. We continue Example [7^] with u = 12131212. We assume that i m = 1, 
i m+ i = i m +2 = 2. One has = u, vF> — v! — 2322, = 32, and = 3. Indeed, 
la! («(!)) = u (°)l, 2a 2 {uW) = u^, and a 2 {u {3) ) = u^. 

The following lemma provides information on the relations between the length and the 
number of occurrences of letters in u and it'. 

Lemma 17. Let u be a factor of tu^ m \ m > 0. Then there exists 5 U E { — 1, 0, 1} such that 

Kl = Him + $U, 

Hj ifj€A\{i m }i 

2 \ U \i m - \ U \ + $u ifj=im- 

Furthermore, \u'\ < \u\ if \u\ > 2, and \u'\ G {0,1} if \u\ G {0,1}. 

Example 18. We continue Example 14 with u = 12131212, i m = 1 and v! = 2322. One 
has \u'\ = 4 = \u\\, with 5 U = 0. Furthermore, \u'\i = = 2|w|i — \u\, \u'\ 2 = \u\ 2 , 
|w'|a = l M l3- 

Proof. By the definition of u', we have \u'\j = \u\j for all j G A\ {i m }, and \u'\ = \u\ im + 5 U 
with S u = e — 5. It follows that \u'\i m = 2 \u\i m — \u\ + 8 U . This ends the proof of the first 
part of the statement. 

If \u\ = 0, then we have vl = i m . If \u\ = \u\ im > 1, then we have 5 U = —1, thus \u'\ < \u\. 
Otherwise, if u does not only contain the letter i m , we have \u'\ < with \u'\ = \u\ if and 



w\ 
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only if \u\ im = \u\ — 1, 5 U = 1. Since 8 U — 1 means that w does not start or end with i, 
\u\i — \u\ — 1 implies that |m| = 1. □ 



m ; 



The following two lemmas allow us to easily exhibit a pair of factors of the same length 
displaying some imbalances when sufficiently large imbalances occur for factors that are 
not necessarily of the same length. 

Lemma 19. Let u,v G A* and j G A such that 

\ u \j ~ Mi > C + max (0, \u\ — \v\) . 
Then there exist factors u of u and v of v with \u\ = \v\ = min(|-u|, \v\) and \u\j — \v\j > C. 



Proof. We first assume \u\ < \v\, hence \u\j — \v\j > C. Let u = u and take for v any factor 
of v with the same length as u (hence \v\ = \u\). Then \u\j — \v\j > \u\j — \v\j > C . 

We now assume |w| > \v\. Take v = v and any factor u of u with \u\ = \v\. One has 
\u\j + (|w| — \v I) > \u\j. We get \u\j — \v\j > \u\j — (\u\ — \v\) — \v\j > C. □ 

Lemma 20. Let u,v be factors of o~i(w) for some w & A* such that \u\ > \v\ and 

\u\ — \v\ 



u\j — \v\i > C + 



for some j £ A \ {i} 



2 

Then there exist factors u of u and v of v with \u\ = \v\ and \u\j — \v\j > C . 

Proof. Let v = v and u be the prefix (or suffix) of u with \u\ = \v\. Since there are no two 
consecutive letters j in o~i{w) and thus in u, we obtain that \u\j < \u\j + ^ ^ ~|. This 
implies \u\j — \v\j > \u\j — — \v\j > C. □ 

Proof of Theorem Let u be an Arnoux-Rauzy word with d = 3 and weak partial quo- 
tients that are bounded by h. We will prove that not only co, but all are (2/i+l)- 
balanced. We will work by contradiction and consider a pair of factors of equal length that 
is not (2/i + rebalanced. We will furthermore take it with minimal length among all such 
pairs, and for all sequences u/ m ). This minimality assumption will prove to be crucial and 
provide the desired contradiction. 

So, suppose that u/ m ) is not (2/i+ rebalanced for some m > 0. Let u, v be a pair of factors 
of u)( m \ m > 0, with \u\ = \v\, that is not (2/i+l)-balanced. Assume w.l.o.g. that \u\ is 
minimal, that is, for any t > 0, for any pair of factors x, y of uj^> with |x| = \y\ < \u\ = \v\, 
and for any letter /c, one has | \x\k — |y|fe| < 2h + 1. This implies that | \u\j — \v\j\ = 2h + 2 
for some j G A. Moreover, we can assume w.l.o.g. that m = and that 

\u\j — \v\j — 2h + 2 . 

We now want to reach a contradiction. 

We can easily rule out the case j = i . Indeed, Lemma [T71 gives \u'\ io — \v'\ io = 2 {2h + 
2) + o~ u — 8v an d \u'\ — \v'\ = 2h + 2 + 5 U — 8 V , thus Lemma dH] provides in turn factors 
u,v of u/ 1 - 1 with \u\ = \v\ = \v'\ that are not (2/i+ rebalanced, with \v'\ < \v\ (note that 
M — \ u \ > \ u \j — \ v \j + 2/i + 2 > 2), contradicting the minimality of \u\. 
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Assume from now on that j 7^ io, thus \u'\j — \v'\j = 2h + 2. If \u'\ < \v'\, then Lemma [T9l 
gives factors u, v of with \u\ = \v\ = \v'\ < \u\ and \u\j — \v\j > 2h + 2, contradicting 
again the minimality of \u\. Therefore, we must have \u'\ > \v'\. By Lemma [T71 this implies 
in particular that \u\ io — \v\ io > —1. The equality |m| = \v\ then gives that \u\k — \v\k < 
— 2h — 1 for the remaining third letter k € A \ {io,j}- 

Observe that we also have \u\k — \v\h > —2h — 1. For if \u\k — \v\k < —2h — 2, then 
by Lemma [T7I one has — \v'\ k < —2h — 2, and Lemma [T9l together with the fact that 
\u'\ > \v'\ gives factors u,v of a/ 1 -* with \u\ = \v\ = \v'\ < \u\ and \v\k — \u\ k > 2h + 2, 
contradicting the minimality of \u\. Thus we obtain that 

\ u \i ~ \ v \io = _1 > \u\j - \v\j = 2h + 2 , \u\k - \v\ k = -2h - 1 , 

with A = {i ,j, k}. 

Let n > 1 be such that io — i\ — ■ ■ ■ — z n _i 7^ i n . Then 

|« (/) li-|v W li = 2/i + 2 and |w (£) | fc - \v {e) \ k = -2h - 1 for < £ < n. 

Therefore, the above arguments showing that \u'\ > \v'\ now show that > 
for < I < n (note that \u^\ < \u\ by Lemma [T71) . 

For 1 < i < n, we do not have |u^|i — I^^U = — 1- Nevertheless, we calculate 

|^+D| _ < | u W|. o _ \ v (t) lio + 2 = (ti W| _ + j 

for < £ < n. By the assumption of the theorem, we have n < h, thus 

(2) 1 < |u (n) | - \v {n) \ < n < h. 

We cannot have i n — k, since this would imply |w (n+1) | - \v {n+1) \ < -2h + 1 < and 
_ — 2h + 2, hence the existence of factors u, v of u^ n+1 ^ with \u\ = \v\ < \u\ 

and — \v\j > 2h + 2 by Lemma [T9l 

Since i n 7^ k, we have i = i\ — • • • — z n _i 7^ i n = j. Then, by Lemma [T71 

| u ("+i)| - > 2h. 

Let r > 1 be such that i n = i n+ i = ■ ■ ■ = i n +r-i 7^ i n +r- According to Lemma [TTJ, 
\u { % = \u(%, \v®\ k = \v (n ^\ k , and similarly \u% = \u^\ io , \v% = \v^\ io , for 
n < £ < n + r. We thus deduce from these equalities and from ([2]) that 

| M W|. _ |^)|. _ _ = | M (n)|. _ | v (n)|. _ (| u (n)| _ | v (n)|) > ^ + 2 

for n < i < n + r, and then, from Lemma [T71 that 

1^+1)1 _ \ v (.t+i)\ > ; _ U% _ 2 > l u W| - 1^1 + ft 
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(n+r) I 



,(n+r)| 



\U 



(n+r) I _ \ v (n+r) 



\U 



(n+r) I 



(n+r) I p _ 



(n+r) I 



(n+r) 



l) + 



> h + 2 + 



\ u (n+r)\ _ I v (n+r) I 



> 2h + 2. 



Now, Lemma [20] provides factors u, v of u/ n+r ) with I u\ = \v\ = |f( n+r ^| and \u\j — \v\j > 
2h+2. Since |^™ +r )| < \v \ by LemmafTTt we have obtained a contradiction to the minimality 
of \u\ in the last remaining case as well. Therefore, u is (2/i+l)-balanced if the weak partial 
quotients are bounded by h. □ 

Proof of Theorem^ Similarly to the proof of Theorem [TJ suppose that a/" 1 -* is not 2- 
balanced for some m > 0. Let u, v be factors of uj^ m \ m > 0, with minimal length 
such that \u\j — \v\j = 3 for some j G A, and assume w.l.o.g. m — 0. 
The proof of Theorem [7] shows that we must have 



u\ 



\U\i - \v\ 



with A = {io,j, k}, and 



\u 



(«) I 



(n)| 



,(n)| 



|w|fe - Mfc = -2 , 



1 < |u 



(ft) I 



,(ft)l 



< n , 



for n > 1 satisfying i = *i 



f n— 1 



7^ i n . Moreover, we must have i n = j, thus 



\u 



(n+l)| 



,("+1)1 



(n+l)| 



; (n+1) l > k (n) L- 



By assumption (1) of the theorem, we have either n > 2 and i 
If i n+ i = k (which holds in particular if n > 2), then Lemma [T71 gives 



n+l 



2 = 1. 
fc, or n — 1. 



,(ft+2)| 



(3) 



- \u 

,(n+l)| 



(n+2) 



fc - IF 
| u (n+l) 



(n+2) I 



>(" +2 )|) 

,(n+l)| 



— W 



(n+l) 



|) >3 



and |i/ n+z )|fc > \u^ n+2 ^\k- Thus Lemma [T9l provides factors u, v of o;( n+2 ) with 



it 



\u 



(n 



+ 2 )| and 



v\k ~~ \ u \k — 3. As |it( n+2 )| < \u\ by Lemma TT7\ this contradicts the minimality 
of \u\. 

It remains to consider the case n = 1, %2 ^ k. We have seen above that n = 1 means 
that ii = j, \u<>% - \vM\j = 3, \u^\ k - \v^\ k = -2, and \u^\ - \v^\ = 1. Let r > 1 be 

V 7^ v + i. Similarly to the proof of Theorem [3, we get 



such that j = i\ = %2 



(£)\ 



u 



(i)\ 



I) 



Ml 



,M|, - 



Wl 



,(1) 



I) 



for 1 < ^ < r + 1, hence 
,(*+!) I 



— f 



(^+i)l 



> lit 



W i 



u 
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for 1 < £ < r, thus 

| u (r+D| _ | v (r+l)| > i ; | u (r+l)| . _ . > 3 ; | u (H-l)| fc _ | v (r+l)| fc = _ 2 . 

We cannot have i r+ \ = k. Indeed, we would have |iz( r+2 '|j — |^ |j > 3 and |u' r+2 )| < 
|<y( r + 2 )| by Lemma \T7\ contradicting the minimality of \u\ by Lemma fTTH 

We thus have i r+ i = i , hence \u( r+2 ^\j — \v^ r+2 ^\j > 3 and \u^ r+2 ^\ k — \v^ r+2 ^\ k = —2 by 
Lemma [TTJ Similarly to the preceding paragraph, we exclude i r+ 2 = k. Then assumption 
(1) of the theorem forces r = 1. Let now s > 1 be such that (i —) i 2 — i 3 — ■ ■ ■ — i s +i ^ 
i s+ 2- Again, Lemma [T71 gives |?/( s+2 )|j — |i^ s+2 ^|.j > 3 and |w^ +2 ^|jfc — |v^ s+2 ' ) |fc = —2, and we 
can exclude i s+ 2 = k, i.e., we have i s+ 2 = j- By Lemma \T7\ we have 

| u (»+3)| _ |^+3)| > | u (-+2)| . _ | V (H-2)| . _ 2 > 1 , \u^\ k - \v^\ k = -2 . 

Since i = i s = i s+ i ^ i s+2 = j in case s > 2, and i = i 2 ^ i 3 = i\ = j in case 
s = 1, the assumptions of the theorem force now i s+3 = k. Replacing n by s + 2 in 
and the subsequent lines, we get factors ii,v of a>^ +4 ^ with \u\ = \v\ = |ii^ +4 - ) | < \u\ and 
\v\k ~ \Mk > 3, providing the desired contradiction to the minimality of \u\. □ 

4.2. Imbalances. In order to prove Theorem [11] and Proposition [T2| we first need a lemma 
that allows us to exhibit pairs of factors in a/" 1 -* having prescribed differences of Parikh vec- 
tors, with the Parikh vector associated with the word w being defined as (\w\i, \1v\2, \w\ 3 ). 

Lemma 21. Let u,v be factors of > 0. Then, for any 5 G {0,±1,±2}, there 

exist factors u,v of such that 

\u\im ~ \v\im = \ u \ + $ , \u\j ~ \v\j = \u\j - \v\j for all j G A \ {i m } ■ 

Proof. Clearly, o"j ra (it) and <7i m (y) are factors of u^ m \ with 

Wi m { U )\im - WiMUm = \ U \ ~ \ V \, Wim( U )\j ~ Wi m ( V )\j = \ U \j ~ \ V \j fol 3 £ 4 \ {z m }. 

Since cr im (u) and &i m {v) end with i m , and i m a im (u) and i m o im (v) are factors of uj^ m \ we 
obtain every 5 G {±1,±2} by removing i m at the end of Ci m (u) and/or Ci m (v) and/or 
appending i m in front of cr im (u) and/or o- im {v). □ 

Proof of Theorem{TJ\ We first prove that a/" 1 -* contains factors u,v whose difference of 
Parikh vectors — |tt| 2 — \v\ 2 , \u\ 3 — \v\ 3 ) is (1,2,-2) or (—1,2,-2). Starting 
from such a pair (u, v) and applying the substitution corresponding to any given word in 
{1, 3}* 1 2* 1 2 {1,2}, we will then exhibit an imbalance of 3 in uo. 

By Theorem |2j the sequence (i m )m>o contains infinitely many occurrences of 1, 2, and 3 
respectively. Therefore, for any m > 0, i m i m+ i ■ ■ ■ starts with a word in 1* 2 ^4* 2 ^4* 3 ^4* 3 
or 1*34*34*24*2. 

Assume first i m ■ • • i m +3 = 2233. Starting with the factor u = 1 of aj (m+4 ' ) and v the 
empty word, and successively applying Lemma |2"T| we obtain the following chain of vectors 
(|u|i — \v\i, \u\ 2 — | ■£> 1 2 , \u\z — l^b) f° r factors u, v of uj( n \ m + A>n> m: 

(:) - • ) ^ (i) s (i) * (i) 
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We can insert between the terms of this chain some substitutions that will not alter the 
difference vectors that have been created. Indeed, except for the last vector, any of these 
vectors is the image of itself by any substitution c?j, j G A, and some 5 G {0, ±1, ±2} as 
in Lemma HU More precisely, if (u,v) are factors of o> (n+1 ) satisfying (\u\i — \v\i, \u\ 2 — 
H2, H3 — \v\ s ) = (1,0, —1), then whatever the value of i n , there exist factors u,v of u)^ n > 
with the same difference vector (1,0,-1). The same holds for the vectors (1,0,-2) and 
(1,1,-2), while the last vector (1,2,-2) is stabilized only by o~i and cr 2 . Therefore, if 
imim+i'-- starts with a word in 1* 2 A* 2 A* 3 A* 3, then cj^ m ' contains factors u,v with 
(Hi - Hi, \u\ 2 - \v\ 2 , \u\ 3 - \v\ 3 ) = (1,2, -2). 

Symmetrically, if i m i m+ i ■ ■ ■ starts with a word in 1* 3 A* 3 A* 2 A* 2, then the chain 















1 


i*( 






^ I 

















1*1 


2 


1*1 




1 








v-2 



gives factors u, v of u^ m ' with (Hi — \v |i, \u\ 2 — \v\ 2 , \u\ 3 — \v\ 3 ) = (—1, 2, —2). This finishes 
our first step. 

Starting with factors u, v G u/ m ) such that (Hi — Hi, \u\ 2 — H2, \ u \3~ | 'i-' 1 3) = (±1, 2, —2), 
we now use the chain 





Here, (0,3,-2) is fixed by a 2 (with 5 = 2); (—1,3,-2) is fixed by 0\ (with 5 = —1) and 
by C3 (with 5 = —2), i.e., this chain can be used for any word in {1, 3}* 1 2* 1 2 1. When 
(Hi — Hi; \ u h " H2, H3 — H3) = (1,2,-2), then the first transition is of course not 
needed and we can use the rest of the chain for any word in {1, 3}* 1 2* 1 2. 

We have thus shown that, if i§i\% 2 --- starts with a word in {1, 3}* 1 2* 1 2 1, then u 
contains factors u, v such that (Hi — Hi, H2 — H2, H3 — His) = (1,3, —2), which implies 
that u is not 2-balanced. If a prefix of ioi\i 2 --- is in {1, 3}* 1 2* 1 2 2, then we get the 
same result, as ioiii 2 • ■ ■ starts now with a word in {1, 3}* 1 2* 1 2 followed by an element 
of 2^*2^*3^1*3. □ 

Proof of Proposition EH Let u> satisfy the conditions of the proposition. Since the condi- 
tions of Theorem [S] are violated only for m — 2, the Arnoux-Rauzy words u^ m \ m > 2, 
satisfy all conditions of Theorem |8] and are thus 2-balanced. By Theorem [TTJ cj« is not 
2-balanced. 

The proof of the 2-balancedness of u will be similar to the proofs of Theorems [7] and [HJ 
Supposing that ui contains a pair of factors that are 2-unbalanced, we construct from this 
pair a 2-unbalanced pair of factors of u^ m \ The only difference is that we require now 
m > 2. 

We first show that a/ 1 -* has no factors u, v with \u\ = \v\ and Hi — Hi = 3 or — \v\s = 
3. To this end, recall that the iS-directive sequence of starts with 112. The proof of 
Theorem [7] (see also the proof of Theorem [8]) shows that if a/ 1 - 1 had factors u, v with 
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\u\ = \v\ and \u\i — \v\i = 3, then the same would be true for uj^ 2 \ contradicting that uj^ is 
2-balanced. We also see from the proof of Theorem [7] that the existence of factors u, v of ui^ 1 ' 
with \u\ = \v\ and 3 — \v\s = 3 contradicts that is 2-balanced for all m G {2,3,4}. 

Now suppose that u> is not 2-balanced, i.e., there exist factors u, v of lo with \u\ = \v\ 
and \u\j — \v\j = 3 for some j G A. Recall that i Q ii • • - i 5 = 211213. 

Assume first j = 3. By Lemma (TTJ we have — M^h = | 1 3 — = 3, 

| u (i)| _ \ V W\ < \ u \ 2 - \ v \ 2 + 2, and |m (2) | - \v {2) \ < |u (1) |i - |u (1) |i + 2 = |u|i - Mi + 2. 
Since \u\ = \v\ and \u\ 3 — |v| 3 = 3, we have either \u\\ — \v\i < —2 or \u\ 2 — \v\ 2 < —2, thus 
|n( 2 )| < |t>( 2 )| or \u^\ < \v^'\. By Lemma [T9| this contradicts that u/ 2 ) is 2-balanced or 
that imbalances of size 3 do not occur in a/ 1 -* for the letter 3. 

Assume now j = 2. We deduce from Lemma [T71 that \ 2 — \ 2 > 4, — \v^\ > 1. 
In the last paragraph we have seen that \u\^ — \v\$ < —3 is impossible. This implies 
— \v\i < —1, hence < — 1 by Lemma fT71 Now we obtain by Lemma fT71 

that I li^ 2 -* I — |f/ 2 )| < 1 and M I2 — | v ( - 2 - ) I2 > 4, which by Lemma [T9l contradicts the 2- 
balancedness of u^ 2 \ 

Finally, assume j = 1 . Then Lemma [T7] provides We cannot have 

< as this contradicts by Lemma HH] that imbalances of size 3 do not occur 

in u/ 1 ) for the letter 1. Therefore, we must have \u\ 2 — \v\ 2 > — 1 by Lemma [T71 As 
MI3 _ M3 < _ 3 is impossible, we get 

|w|i - = 3, \u\ 2 - \v\ 2 = -1 , \u\ 3 - \v\ 3 = -2 , 

Using that \u^'\ > M l> Lemma IT71 gives 

\u^% - \v^% = 3 , M«| 2 - \v<% = , \u<% - \v ( % = -2 . 

As i\ = i 2 = 1, 23 = 2, further applications of Lemma ITT1 give 

M( 4 )|i - M 4) k = M (3) k - k (3) k > 2 (M (2) |i - l^ (2) |i) - (M (2) l - \v {2) \) - 2 

= ^(2)^ _ ^(2)^ + 1^(1)^ _ 1^(1)^ _ (| M (D| _ | v (l)|) _ 2 = ^(2)^ _ | w (2) (i > 3 

and, with z 4 = 1, 

M 5) | - M 5) l > 1, m (5) Is - k (5) ls = Hs - Ms = -2- 

Since i$ = 3, we can replace n by 4 and k by 3 in (J3J) and the subsequent lines, and get a 
contradiction to the 2-balancedness of cu^ 6 \ □ 
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